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A LINEAR /c-FOLD CHEEGER INEQUALITY 


FRANKLIN RENTER AND MARY RACLIFFE 


Abstract. Given an undirected graph G, the classical Cheeger constant, Kq^ measures the optimal partition 
of the vertices into 2 parts with relatively few edges between them based upon the sizes of the parts. The well- 
known Cheeger’s inequality states that 2Ai < Hq < y/2Xi where Ai is the minimum nontrivial eigenvalue 
of the normalized Laplacian matrix. 

Recent work has generalized the concept of the Cheeger constant when partitioning the vertices of a 
graph into k > 2 parts. While there are several approaches, recent results have shown these higher-order 
Cheeger constants to be tightly controlled by Xk—i, the (fc — 1)^*^ nontrivial eigenvalue, to within a quadratic 
factor. 

We present a new higher-order Cheeger inequality with several new perspectives. First, we use an alter¬ 
native higher-order Cheeger constant which considers an “average case” approach. We show this measure 
is related to the average of the first k — 1 nontrivial eigenvalues of the normalized Laplacian matrix. Fur¬ 
ther, using recent techniques, our results provide linear inequalities using the oo-norms of the corresponding 
eigenvectors. Consequently, unlike previous results, this result is relevant even when Afc_i 1. 


1. Introduction 

Let G — (V^E) be an undirected graph, and let C = be the normalized Laplacian 

of G with eigenvalues 0 = Aq < Ai < ... < A^-i- It is a basic fact in spectral graph theory that — 0 
if and only if G has at least k connected components. Additionally, if Ai ~ 0 then the vertices of G can 
be partitioned into 2 parts, nearly disconnected from one another. This is formalized through the Cheeger 
constant and the Cheeger inequality. 

The classical Cheeger constant is defined as 


ho = inf /i(5), where h(S) = 
SCV{G) 


eiS,S) 


min{Vol(S'),Vol(5')}’ 


where e(5', S) is the number of edges between S and its complement, and Vol (S) is the sum of vertex degrees 
in S. The classical Cheeger inequality relates he to the first eigenvalue of the normalized Laplacian matrix, 
as follows. 

Cheeger Inequality (see for example [3]). Let Ai be the first nontrivial eigenvalue of a connected graph 
G. Then 

< he < \/2Ai. 

Recently, some strengthenings of Cheeger’s inequality have appeared (see, for example, mw)- Moreover, 
several recent results have generalized to a so-called “higher order” Cheeger constant (see, for example 
[arani]) by considering a partition of V(G) into k > 2 parts. While there are several different definitions 
of a fc**' order Cheeger constant, one approach is to define the fc-fold cheeger constant to be 

= inf max/i(S'i) 

S i 

where the infimum ranges over all partitions of vertices S = {S'!, 82 , ■ ■ ■ Sk}- In this case, we have: 

Higher-Order Cheeger Inequality (Lee, Gharan and Trevisan, [9]). Let Xk-i be the {k— I)*** nontrivial 
eigenvalue of a connected graph G. Then 


< Oik^)./2X^. 
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This result formally demonstrates that if G can be partitioned into k parts which are nearly disconnected 

from one another, then « 0. Similar results for a variant of can be found in m- 

The Cheeger constant and associated spectral information can be used to find clusters in graphs; that is, 
subgraphs that are highly connected. This has been a topic of wide interest in both the mathematics and 
computer science literature (see, for example, [H [5l HI [11] [121 [131 HI] > among many others). 

This article expands upon previous work on the Cheeger constant in two ways. First, we work with the 
following new notion of a fc-fold Cheeger constant. For a given partition S = {^i, ^ 2 ,..., 5'^} oiV (G), define 
the Cheeger constant of the partition, to be 

^ _ 

k min{Vol(S'i) ,¥01(5),)} 

We then define the k*‘^ Cheeger constant of G to be = inishQ^S). Specifically, while previous work 
focused on generalizing the Cheeger constant using a “worst case” approach, we consider the alternative 
“average case” approach. That is, Hq requires all sets in a partition to have a small Cheeger ratio, whereas 

(k) 

Hq can be small even if a small number of the sets have a large Cheeger ratio. We here reproduce a lower 

(k) 

bound for Hq' that agrees with the standard Cheeger inequality when k = 2. Second, we extend upon 
previous work of the first author |7] which gives a linear upper bound at the expense of using eigenvector 
norms. We prove: 


Theorem 1. Fix a constant k. Let G be an undirected graph on n vertices, with maximum degree A, and 
suppose there exists a constant /3 > 0 such that = o(n~^). Let 0 = Aq < Ai < • • • < Afc_i be the first 

k eigenvalues of C, with corresponding harmonic eigenvectors xo,xi,... ,Xk-i, and suppose that Xk-i < 1- 
Let a = max{||a:i||oo | i = 1, 2,..., fc — 1}, and let A = ^ ~ ^i)- Then the k-fold Cheeger eonstant 

satisfies 


1 

2 



< 


1 

2 


1 

4k 


{k - 1)A 
4Vol(G) 


(1 + 0 ( 1 )). 


In addition, if we do not have Afc-i < 1, we have the following related theorem. 


Theorem 2. Fix a constant k. Let G be an undirected graph on n vertices, with maximum degree A, and 
suppose there exists a constant /3 > 0 such that = o{n~^^. Let 0 = Aq < Ai < • • • < Afc_i be the first 

k eigenvalues of C, with corresponding harmonic eigenvectors xo,xi,. .. ,Xk-i- Let a = ||a:z||oo; and 

let A = X — Ai). Then the k-fold Cheeger constant h^'^ satisfies 


1 A 
-< 

2 2 - 


fik) 


< 


1 

2 


1 

4k 


{k - 1)A 
4Vol(G) 


(1 + 0 ( 1 )). 


These results have several interesting features. First, the upper bounds of previous higher-order Cheeger 
inequalities are generally not applicable when Afc ;+ 1 /k'^ (consider the complete graph, for example). 
In contrast, under mild conditions. Theorem [T] or [5] applies even if Afc ;+ 1/fc^. Additionally, the result 
demonstrates that the “average case” A:-fold Cheeger constant is tightly controlled by the average of the 
first fc — 1 nontrivial eigenvalues whereas the previous “worst case” approaches tightly control hf, with Afc. 
Finally, and perhaps most interesting, Theorem [T] shows that the Cheeger ratio can be elegantly bounded to 
within a linear factor of the corresponding eigenvalues when the eigenvector norms are considered. 

We note that although the bound in Theorem [^ appears much weaker than that of Theorem [1] it is in 
fact not necessarily weaker at all. Indeed, if A is negative, that is, if the average of the first k — 1 nontrivial 
eigenvalues is greater than 1, then Theorem [2] gives a stronger bound than Theorem [T] as the term involving 
A will be positive in this case. Indeed, as seen in Section [5] the bound given in Theorem [2] is quite good for 
the complete graph Kn- 

We present this article as follows. In Sectional we give relevant background and definitions. Then, we 
prove the lower bounds of Theorems [T] and [2] in Section [3] and the upper bounds in Section [4| Finally, we 
conclude with the example of applying our result to Kn in Section [5] 
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2. Preliminaries 


To prove the upper bound in Theoreni[TJ we shall use tools from both probability theory and graph theory. 
To begin, we define our graph-theoretic notation. 

Given a graph G, define the adjacency matrix A to be the square matrix, indexed by P(G), with = 
lur^v, the indicator of whether {u,!;} S E{G). Define D to be the diagonal matrix indexed by V{G) with 
D«,ii = deg( 3 (u). For simplicity of notation, if the graph is understood, we write du = degQ{u). The 
normalized Laplacian matrix, £, is given by £ = — A)D“^/^. By convention, if G has an isolated 

vertex u, set = 0. The eigenvalues of £ will be written as 0 = Aq < Ai < • • • < A„_i. For a 

matrix or vector X, we let X* denote the conjugate transpose of X. 

For a subset S C V{G), define Vol (S') = Define Vol (G) = Vol(P(G)). Write Is to denote the 

vector indexed by V{G) with Is('c) = the indicator of whether v is an element of S. For convenience 

of notation, we write 1 = 1 v(g)- Given two subsets S,T C V{G), define e{S,T) to be the number of edges 
with one incident vertex in S and the other incident vertex in T. It is a standard exercise in graph theory 
to verify the following: 

Proposition 1. For S,T subsets ofV{G), we have 

e(S, T) = (Di/2ig)*(i _ 

Given an eigenvalue-eigenvector pair (A, v) for £, define the harmonic eigenvector corresponding to A 
to be Harmonic eigenvectors can be a useful tool for analyzing the normalized Laplacian. More 

information about harmonic eigenvectors and their uses can be found in [3], for example. For any graph G, 
there is always an orthonormal basis of eigenvectors vq, Vi,..., v„_i; we shall assume throughout that such 
a basis has been chosen, and the harmonic eigenvectors used have been derived from this basis, so that the 
harmonic eigenvector corresponding to Xi will be precisely D" We note that vq = (Vol (G)) 

In addition, we shall require the following tools from probability theory. Given a random variable X, we 
use E [A] to denote the expected value of A. If A is an event in a probability space, we use 1^ to denote 
the 0—1 indicator random variable for A. Given a matrix M whose entries are all random variables, E [M] 
denotes the matrix of entry-wise expectation. We shall use the following result from [7]: 

Proposition 2. Let x £ C" be a random vector whose entries are pairwise independent, and let p. = M [x]. 
If A is an n X n symmetric matrix with An = 0 for all i, then 

E [x*Ax] = /i*A/i. 

Given a random variable A, we say A has a Bernoulli distribution with parameter p if P(A = 1) = p, 
and P(A = 0) = 1 — p. We write A ~ Ber (p). 

In addition, we shall make use of Chernoff bounds. Chernoff bounds are a class of concentration inequalities 
that consider sums of independent random variables. Often the variables considered are Bernoulli, though 
that may not be the case here. There are many versions of Ghernoff bounds (see, for example, m); we shall 
use the following: 

Proposition 3. For i = 1,2,... ,k, let Xi be a nonnegative random variable with Xi < A. Let S = 'YfXi, 
and let fi = M [S']. Then for any e > 0, 

P(|S - p| > £/i) < 2 exp • 

In addition, to prove the lower bound in Theorems [T] and [21 we shall make use of the following linear 
algebra theorem. This result can be derived as a corollary of the Gourant-Fischer Theorem, and can be 
found, for example, as Corollary 4.3.18 in [^. 

Theorem 3. Let M. be an n x n Hermitian matrix with eigenvalues Ao < Ai < • • • < A„_i. Fix k <n, and 
let Un,k denote the set of n x k complex matrices with orthonormal columns. Then 

fe-i 

V A* = min tr (U*MU). 
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Rephrased, this theorem states that if fi, f 2 ,..., ffc is a collection of orthonormal vectors in C" and M is 
a Hermitian matrix, then 

k k—1 

i—1 i—0 


3. The Lower Bound 


In this section, we prove the lower bound from Theorems [T] and [2] Recall that A = ^ Si=i (1 “ the 
average of the largest k eigenvalues of I — 


Theorem 4. Given k >2, 


/iW > ^ 


1 A 


‘G - 


2 2 


Proof. Let S = {S'!, S 2 , ■ ■ ■, Sk} be a partition of the vertices of G. For 1 < j < k, let gj = ^ Ig^. 

y Voi(Si) 

Note that as the Si are all disjoint, we have that the set {g^} is orthogonal. Moreover, \\gj\\^ = J2veSi Voi{S ) ~ 
1, and hence the set {gj} is in fact orthonormal. Then by Theorem [3l we have 

k k— 1 

'Y^{g*Cgi) > ^ Ai = fc(l - A). 


2=1 


2=0 


On the other hand, by Proposition [H we have 


= g*gi - g* (i - ^)gt = i - 


Vol (S,) ■ 


Combining these two results yields 


1 - A < 


1 


EL- 


^2=1 

k 


- r EP- 


e{S^,S.) 

Vol (5,) 

+ ^ e{S,,S,) 


k 


Vol {Si) 


E 




Vol (5,) 


- r EP- 


Vol(5,) ^e{S,,S,) 

Vol (5,) ^ Vol (5,) 


l^f e{S,,S,) e{S,,S,) 

h2^\ Vol(5'i) Vol(5'j) 


< 2 h 


(k) 
G ’ 


as desired. 


□ 


4. The Upper Bound 

The proof of the upper bound in Theorem [T] is modeled after the proof of a similar upper bound by the 
first author in [7] when k = 2. The strategy employed is to choose a vector randomly so that the expectation 
of this vector has useful algebraic properties, and apply concentration results for the expectation. 

Proof of T/ieorem [7J upper bound. Let xi,X 2 ,... ,Xfe_i be the first k — 1 nontrivial harmonic eigenvectors 
for G, with corresponding eigenvalues Ai, A 2 ,..., A/c_i. As above, we shall assume that Xi = 
where the set {vq, vi,..., v„} is an orthonormal basis for R." composed of eigenvectors of £. Recall that 
vn = ^ D^/^1, and hence for each i, 

^ x,(u)d„ = X*D1 = yvol (G)(D-V2 v,)*dV2vo = ^/Vol (G)v*vo = 0, 
dGU(G) 

so each x^ is orthogonal to D\. Let a = max{||xj||oo | J = 1, 2,..., k}. 
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Let <5 > 0 be a constant that will be defined later. For all v G V(G), define a random variable Sy by 


Sy = j with probability 2 \k% + 2 (fc-i)|| 2 j||„o for j = 1, 2 ,..., fc - 1 
Sy = k otherwise 

Note that the random variables for different vertices are independent. For each j = 1, 2,..., fc, define 
Sj = {t! G V I Sy = j}. Thus we can view the random variables Sy as partitioning the vertices of G into 
k sets. Let Wj = , the indicator vector for the set Sj. As the choice of set Sj for each vertex v is 

independent of each other vertex, we have that the entries in Wj are pairwise independent (although any 
pair Wj, Wf are not independent). Notice that for j = 1, 2,..., fc — 1, we have 

E [Vol (Sj)] = ^ P(z; e Sj)dy 

vGV 

= 5 ((4^) 2(t- 

by orthogonality of Xj to Dl. Note that p, is independent of the choice of j. 

On the other hand, for a given j < k — 1, we can view the vertices in Sj as chosen by a sequence of 

independent Bernoulli random variables, Xy, where Xy ^ Ber ^ 2 (fc-i) + 2 (fc^i)||x-|| —)• Thus, Vol (S'j) = 
J2vev(G) dvXy, and by Proposition |3l we have that for all £ > 0, 


P(| Vol {Sj) — /r| > ey) < 2 exp 



where A is the maximum degree of a vertex in G. 

Let A be the event that (1 — e)^ < Vol {Sj) < (1 +e)/i for all 1 < j < fc — 1. By the union bound, we have 

P(A^)<2(fc-l)exp(^^). 


Thus for each j , we have 


E [e{S„S,)] = E [e{S„Si)lA + e{Sj,S,)l^] 


< 


E [e(^„ + Vol (G) (^2{k - 1) exp ) 

e{Sj,S,) 


■Yo\{S,)1a 


[Vol(5,) ^ 


+ 2{k - 1) Vol (G) exp ( 


9 

e 


Vol(^) 

Yol{S,)-e{S„^,) 


Vol(5,)l^ 


+ 2(fc-l)Vol(G)exp(-^ 


2 

e fi 


YoI{S,)1a 


Yol{S,) 

Using linearity of expectation, we can sum over j to obtain 


+ 2{k- 1) Vol (G) exp ( ) . 
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E 


k-l 

Y,e{S,,S,) 


k-l r 


^ E 

i=i 


E 


Yo\{S,)-e{S„S,) 


< (1 + e)fj, E 

= (l+e)/iE 

= (l+£)/iE 


YoliS,) 


i=i 

/ 


YoliS,)J 

k-l 


k-l _ V V 1 


Vol (S'j) 1a + 2(fc - 1) Vol (G) exp (-eV/(3A)) 

+ 2{k - if Vol (G) exp (-eV/(3A)) 

+ 2{k - i f Vol (G) exp (-eV/(3A)) 


1a 




fc-l-V 


fe{S„Sf ^eiS,,Sf\ 


k-l 

E 


<s„sA 


\Yo\iSj) Yol {SfJ ^ Vol(5fc) I 
+2{k — 1)^ Vol (G) exp (3A)) 

2e{Sj,Si) l-£ 


1a 


< (1 + e)fi E 


/fc- i-V — 

f ^ mi 




min{Vol(5i),Vol(5j)} 1 + e 


+ 1 1a 


+2{k — 1)^ Vol (G) exp (3A)) 

(fe)l-e 


< (1 + e)fi I k — 2kh 


1 + e 


+ 2{k — 1)^ Vol (G) exp (—e^^/(3A)) . 


On the other hand, we also have Wj = Ig., 


E [wj] = 


1 - 25 
2{k-l) 


1 + 


2||x,l| 


•J l|oo 


and thus, as the entries in w^- are independent as noted above, by Proposition [51 we have 


E[e(5„5,)] = E[(Di/2w,r(I-£)(Di/2w,)' 


1 - 25 
2{k-l) 
1-25 


1 + 


1 


We therefore obtain 

Vol (G) + 


^ 1 / / -I QC \ 2 


1 = 1 


,2(k-l) 


2(k-l) 
1-A 


Vol (G) + 


2 ||xj ||oo 

1 - A,- 


xj) D1/2(i_/:)d1/2 


1 - 25 
2{k-l) 


1 + 


2||Xj||oo 


4||x,|lL' 


4||x,||^ 


< (1 + e)^ — 2kf'Q ~ (3A)) ■ 


Recall that = o{n by hypothesis; choose 5 = e = n Then we have 

exp (-£V/(3A)) = exp 

for G an appropriate constant. As Vol (G) < n^, we thus have that the error term satisfies 

2(fc — 1)^ Vol (G) exp (—£^/r/(3A)) < 2{k — l)^n^ exp(—G(1 — o(l))n^/^) = o(l). 
Moreover, as ||xj||oo > a and 1 — A^- > 0 for all j, we have 


( 1 ) 


k-l 

E 


1 — A, ^ fcA 


;^4||x,||^ -4a2- 


Therefore, 









































k-1 

E 

i=i 


1 - 2S 
2 {k-l) 


Vol (G) 


1 - A, 


4||x 
Vol (G) + 


2 

■J Iloo 


< (1 + e)fi ( k — 2kh 


1 + s 


^( 1 ) 


( 1 -2(5)2 kA Vol(G) 


4(fc- 1) ' ' 4a2 - ^ ^ "2{k-l) 


Solving for h^'’ yields 


as desired. 




1 1 


- o(^)) 2 {k'-l) “ ‘^khP{l - o(l))) + o(l) 

(fc- 1)A 


_2 4fc 4Vol(G)a2 


(1 + 0 ( 1 )), 


□ 


To obtain the proof of Theorem [2l we note that the only inequality that fails above when Afe_i > 1 is ([T]). 
To correct for this problem, we shall slightly modify the definition of the random variable Sv for each v. 


Proof of Theorem\^ upper bound. As in the proof of Theorem [U Let Xi, X 2 ,..., Xfc_i be the first k — 1 
nontrivial harmonic eigenvectors for G, with corresponding eigenvalues Ai < A 2 < • • • < Afc_i. Let a = 
HiZi l|x*||oo- Let ^ > 0, and for all v £ V{G), define a random variable Sy by 

Sy = j with probability ^0^ + 2 (k-i)a for j = 1, 2,..., fc - 1 
Sy = k otherwise 


Proceed with the proof as in Theorem [TJ noting that we can replace inequality © with the equality 

k-1 




J = 1 


4a2 


kA 


and that all else is unchanged. The result then follows. 


□ 


5. Example 


As an example of an application of Theorem [51 we consider the complete graph Kn. Take fc to be a hxed 

(k) 

constant, and we shall consider the asymptotics of Kq as n 00 . 

As with the standard Cheeger constant, it is quite clear to see that will be minimized when the 

Si are roughly an equipartition of n, that is, when there are exactly r = nmodk sets of size and the 
rest are of size . Letting S be such a partition, we have 



Recall that the Laplacian eigenvalues of Ar„ are Aq = 0 and Ai = for all i > 0. Therefore, the lower 
bound given in Theorem |T] yields > ^(fc — l);j^ 1 ~ ^ estimate for the Cheeger ratio. 

Moreover, A = ■^^(—;;;^). Also, we have vq, the eigenvector corresponding to 0, is given by vq = -^1. 
Thus any vector perpendicular to vq is an eigenvector for Note that as A is negative here, we wish 

to maximize in order to minimize the upper bound. Thus we take = -^(e 2 i_i — 62 ^). It is clear 
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that these are orthonormal. In addition, this implies that the harmonic eigenvectors are = D = 

- 62 .), and hence 

Therefore, we obtain as the upper bound from Theorem [5] 



compared to the true constant ^ 


1 

4fc 


(fc- 1 )A 
4Vol(G) a 2 


(1 + 0 ( 1 )) 


1 

4fc 4n(n-l)|^ 



(1 + 0 ( 1 )), 


(1 + 0 ( 1 )) 
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